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ABSTRACT 



A system and method for queuing, control and transfer 
of data between a host processor and a peripheral pro- 
cessor using an architecture and a data flow strategy of 
one or more virtual FIFO data structures stored in main 
memory and a hardware FIFO under control of the 
host and peripheral processors. One virtual FIFO at a 
time drives the data FIFO with data. In turn, the data 
FIFO drives a peripheral device with this data. The 
host software ninning on the digital processor controls 
the loading of data for each process (context) into its 
associated virtual FIFO. The host processor controls 
the operation of the peripheral processor and the virtual 
FIFOs. The peripheral processor controls the flow of 
data from the data FIFO to the peripheral device, and 
under control of the host software, the flow of data 
from the driving virtual FIFO to the data FIFO. Start 
and end address pointers for each virtual FIFO stored 
in associated memory block indicate the memory loca- 
tion in the virtual FIFO where data is stored. The pe- 
ripheral processor also keeps a start address pointer of 
the memory location of the last unused data then read 
out of the data FIFO to the peripheral device. These 
address pointers allow the hardware FIFO to be flushed 
when a context switch occurs, which can take place 
before all of the data in the driving virtual FIFO is 
supplied to the data FIFO, and supplied by the data 
FIFO to the peripheral device. 

21 Claims, 14 Drawing Sheets 
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1 2 

First peripheral 130, second peripheral 132, and third 

SYSTEM AND METHOD FOR TRANSFERRING peripheral 134 can either receive data and instructions 
DATA BETWEEN A PLURALITY OF VIRTUAL from digital processor 126, receive instructions from 
FIFO'S AND A PERIPHERAL VIA A HARDWARE and send read data to the digital processor 126, or re- 
FIFO AND SELECTIVELY UPDATING CONTROL 5 ceive instructions from and send to and receive data 
INFORMATION ASSOCIATED WITH THE from the digital processor 126. 

VIRTUAL FIFO'S i n the conventional system and method of FIG. 1, the 

data and instructions are received from main memory 

BACKGROUND OF THE INVENTION 102 via a CPU-memory bus 122 and a bus adapter 120 to 

1. Field of the Invention 10 ™ I/O bus 110. 

The present invention relates generally to the queu- Th e architecture of FIG. 1 can also be applied to 

ing, control and transfer of data between a host proces- systems with multiple memory devices 101 and/or I/O 

sor and a peripheral processor, and more particularly, buses 110. 

to first-in-first-out (FIFO) systems and methods be- FIG. 2 shows another conventional architecture and 

tween a host processor and a peripheral processor. method, where the peripherals are connected directly 

2. Related Art to cache 124. Specifically, I/O controllers 106, 112 and 
While input/output historically has been the orphan H<5 are connected via the I/O bus 110 and bus adapter 

of computer architecture, it is absolutely critical to the 120 ^ a bus 204 to cache 124, which is connected to 
performance of any traditional class of computers: 2Q CPU-memory bus 122, and to CPU 126 via a translation 
mainframe, minicomputer, workstation, file server, and look aside buffer (TLB) 202. Like the conventional 
personal computer. As stated by John L. Hennessy and system of FIG. 1, the streams of instructions and data 
David A. Patterson in their seminal book Computer for the peripherals in the system of FIG. 2 may be 
Architecture A Quantitative Approach, Morgan Kauf- placed in a deep hardware buffer such as a FIFO as part 
mann Publishers, Inc., Palo Alto, Calif., U.SJV-, 1990, p. 25 of the I/O controller, or as a software data structure 
499 (which is incorporated herein by reference): stored in main memory 102. 
A computer without I/O devices is like a car without A conventional system and method for transferring 
wheels— you can't get very far without them. And data between a digital processor and a peripheral pro- 
while CPU time is interesting, response time— the cessor utilizing virtual direct memory access (DMA) is 
time between when the user types a command and 30 shown in block architectural form in FIG. 3. The vir- 
when she gets results — is surely a better measure of tual DMA requires a register for each page to be trans- 
performance. The customer who pays for a com- ferred in a DMA controller 302, showing the protection 
puter cares about response time, even if the CPU bits and the physical page corresponding to each virtual 
designer doesn't. Finally, as rapid improvements in page. Address-translation registers 304 connect DMA 
CPU performance compress traditional classes of 35 302 to CPU-memory bus 122. 

computers, it is I/O that serves to distinguish them. Note that the architecture in FIG. 3 (and for that 
"Smart" peripheral devices have internal processing matter FIGS. 1 and 2) can also be implemented without 
functionality. Such smart peripheral devices often re- an I/O bus 110. The peripheral could be directly con- 
quire streams of instructions and data. nected to DMA 302, and in turn, directly connected to 
Referring now to FIG. 1, the architecture of conven- 40 CPU-memory bus 122. The architectures shown in 
tional smart peripheral devices is shown. The streams of FIGS. 1, 2, and 3 are illustrative only, and are not in- 
instructions and/or data may either be held in one or tended to represent a detailed discussion of computer 
more specific data structure^) 101 in a main memory architectures in general. 

102, or in a separate (deep) hardware memory, which in Regardless of whether the instructions and/or 

conventional systems is usually a FIFO located in an 45 whether the streams of instructions and data are held in 

I/O controller (such as 106, 112, or 134) associated with the software FIFO data structure 101 of main memory 

(or part of) the I/O or peripheral device. 102, or are placed in a hardware FIFO buffer, there is a 

In the example shown in FIG. 1, there are three so- trade-off as to the depth of the FIFO of these conven- 

called peripherals that are shown for purposes of illus- tional systems. The software specific data structure 

tration. The first peripheral, which is designated gener- 50 approach is slow since the streams of instructions and 

ally by a reference numeral 130, is made up of an input- data must be sent over CPU-memory bus 122 via bus 

/output (I/O) controller 106, a disk drive 104, and a adapter 120 to I/O bus 110. 

disk drive 108. I/O controller 106 may or may not con- The hardware FIFO buffer approach is faster. How- 
tain internal processing functionality in the form of a ever, a deep hardware FIFO is expensive, especially in 
peripheral processor (CPU) or peripheral controller 55 a computer system using a high clock rate. As is well- 
(not shown). known, higher clock rates are constantly being used for 

The second peripheral is designated generally by a the host processor 126 to improve system performance 

reference numeral 132. Second peripheral 132 is made and increase system functionality, 

up of an I/O controller 112 and a graphics output 114, Regardless of which of the conventional systems of 

which typically is a cathode ray tube (CAT) display or 60 FIGS. 1, 2 and 3 is used for the instruction and data path 

a frame buffer, but can be any suitable graphics output between the digital processor (CPU) 126 and the pe- 

device. Like I/O controller 106, I/O controller 112 may ripheral processors found in the I/O controllers 106, 

or may not have internal processing functionality. 112, and 116, some type of FIFO system must be used, 

The third peripheral is designated generally by a either the specific data structure(s) 101 stored in main 

reference numeral 134, Third peripheral 134 is made up 65 memory 102 or a deep hardware FIFO buffer, 

of an I/O controller 116 and a network 118. Like I/O To reduce latency, it is advantageous that as much of 

controller 106, I/O controller 116 may or may not have the instructions and data that will be needed by the 

internal processing functionality. peripheral processor be present in the FIFO. Thus, a 
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conventional strategy for reducing latency is to create a 
larger (also called a "deeper") FIFO. 

However, the use of a deeper FIFO approach means 
that more instructions and/or data must be dealt with 
before the stream of instructions and/or data can be 5 
switched to another stream of instructions and/or data. 
Such a switch in the streams of instructions and data is 
called a "context switch." In other words, all of the 
instructions and/or data in the FIFO must be dealt with 
by the I/O controller 106, 112, 116 before the stream of 10 
instructions and/or data being provided to the I/O 
controller 106, 112 or 116 is context switched. This has 
the effect of increasing latency because of the extra time 
that is taken before the context switch can take place 
between streams of instructions and/or data. 15 

One conventional approach for reducing latency is to 
make the FIFO shallow. This means that less instruc- 
tions and/or data must be dealt with before the stream 
of instructions and/or data can be context switched. 
However, the use of a shallow FIFO means that the 20 
FIFO must be serviced more often. This increased ser- 
vicing consumes additional bandwidth and CPU over- 
head (for checking the state of the FIFO), which is not 
desirable. 

Thus, it would be advantageous to provide a system 25 
and method whereby the streams of instructions and/or 
data between a digital processor and a peripheral pro- 
cessor can be context switched quickly, is inexpensive 
to implement, minimizes impact on system latency, and 
reduces memory bandwidth requirements. 30 

SUMMARY OF THE INVENTION 

The present invention is a system and method which 
utilizes a shallow hardware FIFO along with system 
memory to reduce the cost of the FIFO hardware, to 35 
allow relatively fast context switches, and to decrease 
latency and memory bandwidth requirements. 

In broad terms, the present invention is a system and 
method for queuing, control and transfer of data be- 
tween a host processor and a peripheral processor using 40 
an architecture and a data flow strategy of one or more 
virtual FIFO data structures stored in main memory 
and a hardware data FIFO (which typically is physi- 
cally located on the same chip or circuit board with the 
peripheral processor) under control of the peripheral 45 
processor. One virtual FIFO at a time drives the data* 
FIFO with data. In turn, the data FIFO drives a periph? 
eral device withjthis data? As employed herein the term 
"drives*' is useff tc? mean supplying data to a device. 
Thus driving the data FIFO means supplying or passing 50 
data to the data FIFO, f f 

The host software running on the digital processor 
controls the loading of data for each process (context) 
into its associated virtual FIFO. The digital processor 
under control of the host software and system software 55 
(or firmware) controls the operation of the peripheral 
processor and the virtual FIFOs. The peripheral pro- 
cessor controls the flow of data from the data FIFO to 
the peripheral device, and under control of the host 
software, the flow of data from the driving virtual 60 
FIFO to the data FIFO. 'J ■ ■ 

Start and end address pointers for each virtual FIFO 
(stored in main memory in an associated memory block) 
indicate the memory location in the virtual FIFO where 
data is stored. The peripheral processor also keeps a? 65 
start address pointer of the memory location of the last 
unused data then read out of the data FIFO to the pe- 
ripheral device. J 



These address pointers allow the hardware FIFO to 
be flushed when a context switch occurs, which can 
(and usually does) take place before all of the data in the 
driving virtual FIFO is supplied to the data FIFO, and 
supplied by the data FIFO to the peripheral device. A 
strategy for updating the address pointers improves 
effective memory bandwidth. 

The present invention allows for main memory la- 
tency, reduces main memory bandwidth utilization, 
allows for relatively fast context switching, and reduces 
hardware cost because the storage capacity of the 
(hardware) data FIFO is small relative to the storage 
capacity of each virtual FIFO. Total storage capacity^ 
each virtual FIFO, data FIFO combination is large/ 
with the added benefit that the hardware data FIFO can 
be fast, which reduces latency. Moreover, context 
switching is relatively fast because it can be done by 
changing address pointers and by dumping the contents 
of ^0^it^e¥WQif ' 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is better understood with refer- 
ence to the following drawings and the specification. 

FIG. 1 is a block architectural diagram of a conven- 
tional interface of I/O devices and an I/O bus with a 
CPU-memory bus. 

FIG. 2 is a block diagram of a conventional interface 
of the I/O connected directly to the cache. 

FIG. 3 shows in block architectural form a conven- 
tional digital processor and peripheral processor system 
having virtual direct memory access, which requires a 
register for each page to be transferred in the DMA 
controller. 

FIG. 4 is a block diagram showing at a high level the 
data flow and control paths of the present invention. 

FIG. 5 is made up of FIGS. 5A and SB. 

FIG. 5A shows a preferred virtual FIFO data struc- 
ture in accordance with the present invention where 
data does not wrap. 

FIG. SB shows a preferred virtual FIFO data struc- 
ture in accordance with the present invention where 
data wraps to the beginning. 

FIG. 6 is a flow chart showing at a high level the 
operation of the digital processor 126 under control of 
host software with the DMA controller 404 in accor- 
dance with the present invention. 

FIG. 7 is a block diagram showing the end address 
pointer strategy used for updating (or not updating) the 
end address pointer in order to save bandwidth in both 
writing and reading of the end address pointer. 

FIG. 8 is a block diagram showing the start address 
pointer strategy used for updating (or not updating) the 
start address pointer in order to save bandwidth in both 
writing and reading of the start address pointer. 

FIG. 9 is a block diagram representation showing an 
alternate embodiment of the present invention where 
host processor 126, the DMA controller 404, and hard- 
ware FIFO 402 are all fabricated on a single (microelec- 
tronic device) semiconductor chip 900. 

FIG. 10 is a block diagram showing an embodiment 
of the present invention where the DMA controller 404 
controls n (hardware) data FIFOs, where n is a positive 
integer greater than or equal to 2, which creates n chan- 
nels. 

FIG. 11 is a block diagram showing an embodiment 
utilizing two host CPUs for controlling the DMA con- 
troller 404 and the (hardware) data FIFO 402. 
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FIG. 12 is a block diagram of an alternate embodi- For illustration purposes, consider a workstation that 
ment of the present invention using a single host CPU has a number of windows currently opened. Each win- 
126 to control two or more DMA controllers with dow has its own context. Thus, when the user wants to 
associated (hardware) data FIFOs. switch between windows the workstation must save the 

FIG. 13 is a block diagram showing an alternate 5 context associated with the first window and switch to 
embodiment where a buffer (dual ported memory) is on the context of the second window, 
the I/O bus 110 so as to allow the present invention to As indicated above, control and transfer of data be- 
be utilized with a personal computer product where the tween a digital processor and a peripheral processor can 
I/O bus cannot accommodate direct memory access. *> e accomplished with only one virtual FIFO for all 

FIG. 14 is a block diagram of an embodiment of the 10 processes. Moreover, the present invention can be im- 
present invention where the hardware FIFO 402, DMA plemented with one virtual FIFO assigned to multiple 
controller 404, and graphics function stage 1402 are all processes. The hardware configuration of the present 
physically part of a peripheral 1400. invention does not restrict the use of one virtual FIFO. 

Start and end address pointers for each virtual FIFO 
DETAILED DESCRIPTION OF THE 15 (stored in main memory in an associated control block)* 

PREFERRED EMBODIMENTS indicate the*memory location in the virtual FIFO where * 

A. General Overview and Terminology infotnlatib^ ateo 

The terms host processor, host CPU, digital proces- k'eepsTasfcu-t address -pointer of the memory location of> 
sor, master CPU and CPU are often used interchange- last ^used data read out of the hardware FIFO XO f 

ably in this field. The term "host processor" is used 20 toe peripheral deviee*jFhese address pointers allow thc^ 
hereafter with the understanding that other similar driving virli?a1^HFO' lp„be^fl a context' 

terms could be substituted therefor without changing swjtch-occurs, which can take place beforejall 6f_,the> 
the underlying meaning of the disclosure. data in. the"drivmg virtual FIFO is supplied to the hard- ? 

The terms chip, integrated circuit, monolithic device, ware*FIFO, and supplied by the hardware FIFO to the 
semiconductor device and microelectronic device are 25 Penpheral device. -A strategy for updating the address 
often used interchangeably in this field. The present PO^ers -unproves system efficiency? 
invention is applicable to all of the above as they are ™ e Pf esen f inventlon aUows for latency associ- 
generally understood in the field. atcd Wlth mam meraor > r > reduces main memory band- 

The numbering convention is as follows. The right lft ^ dth utilization allows for relatively fast context 
two digits indicate the item number. The left digit or 30 swltcmn S . and reduces hardware icost because the stor- 
digits indicate the Figure where the item first appears. ff f pacity ° f th f ™^!™!!? ™ atIVC t0 

The same number in each Figure indicates the identical t 

item. Wherever possible, the Figures are discussed in °T ? , Tt hardw ^ FIp O combi- 

numerical order in the text. „ »*£ ? d .™ can be fast, 

In broad terms, the present invention is a system and 35 ^^^i^J^ h * rfad ^ y 

. r A . , . * , . fast because it can be done by changing pointers and bv 

^JST^ °T d ^ f ^ ^ d ™P in * the conten * of thetodwLe^FO * 
tween a distal processor and a peripheral processor t mvemion i$ envisioned u bei 

using ^ arc^tecture and a data flow strategy of one or m a ^ Y scale m d (VLgI) j* 

more virtual FIFO ^structures stored in mam mem- 40 sor (of theType used wh¥ the second peripheral 13?). 

S * h ^T^^^ H ™ ever ' * sh °» ld be ^erstood thatVpresent in- 

located on the same chip or circuit board with the pe- vention ^ be used m other applications ^ well . 

«S cS^\ S£ Tl Pr °" Referrin S now to FIG - 4 > which * a block diagram 

™ 5a T V dnv %^ dwarc showing the control and data flow of the present inven- 

FTFO with data. In turn, the hardware FIFO drives a 45 tion . Main me 102 include$ a ^ of £ ueues 

peripheral device with this data. The host software ^ ^mbI FIFOs. Three virtual FIFOs are shown for 

runmng on the digital processor controls the loading of purposes of illustration: they are virtual FIFO(l), which 

™I° r ^ u PrOCCSS (context > mt0 lts associated virtual is referr ed to by reference number 406; a virtual FI- 

FEFO. The host processor under control of the host F 0(2), which is referred to by reference number 40* 

software controls the operation of the peripheral pro- 50 and a virtual FIFO(3) ( which is referred to by reference 

cessor and the virtual FIFOs. The peripheral processor number 410. It should be understood that the present 

controls the flow of data from the hardware FIFO to invention contemplates one or more virtual FIFOs lo- 

tne peripheral device, and under control of the host C ated in main memory 102. 

software, the flow of data from the driving virtual The virtual FIFOs located in main memory 102 are 

FIFO to the hardware FIFO. 55 used to drive a peripheral device, referred to generally 

As employed herein the term "context" is defined as by a reference number 401. This driving by the data 

the current state of a processor. In a CPU the contextbf from the virtual FIFO in question takes place through a 

that processor would be, for example, the contents of hardware FIFO 402. It should be noted that the virtual 

the register files, contents of the status registers, and>the ; FIFO's can be incorporated into any dedicated memory 

current state of the error flags. In the case of a smart 60 (e.g., a 2-port memory on an I/O card). 

1 peripheral _the context attributes nnght include any attri- Any type of peripheral device 401 can be used by the 

butes that have been set and at what point in its pipelme present invention. In the preferred embodiment, the 

those attributes have been set. Thus, context switching? peripheral device is a graphics processor having a frame 

entails switching the states of the different processes buffer (not shown). Alternately, the peripheral device 

currently running on the computer system employing 65 401 could be a disk drive 104, or a network 118, or any 

the^presejn invention. In other words, the states of one / other output device. 

r proces^r^vedland switched with the states of an- The DMA controller is part of peripheral subsystem 

other process inside the driving virtual FIFO. 132. It may or may not have internal processing func- 
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tionality. It should be noted that DMA controller 404 
may be part of peripheral 132 (as shown in FIGS. 1, 2, 
and 3) or may be a separate peripheral device. 

DMA controller 404 is under the control of software 
(not shown) or firmware (not shown) running on the 
host processor 126. The software or firmware control 
allows host processor 126 to efficiently provide data 
and instructions to peripheral device 401 in accordance 
with the present invention. This control is via a path 428 
effectively between host processor 126 and DMA con- 
troller 404. This path will depend upon the architecture 
of the system, such as those shown in FIGS. 1, 2 and 3. 

Data FIFO 402 is the hardware FIFO referred to 
above. Data FIFO 402 is typically part of peripheral 
132. It can be a separate hardware FIFO made up of one 
or more integrated circuits (ICs), which are part of the 
board including I/O controller 112. In its preferred 
form, hardware FIFO 402 is a modest sized on-chip 
hardware data FIFO, which is part of the VLSI graph- 
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would come from the main memory 102 via CPU-mem- 
ory bus 122, bus adaptor 120, and I/O bus 110. 

Alternately, the present invention could utilize the 
architecture of FIG. 2. Specifically, DMA controller 
404 could be part of I/O controller 112. The data from 
main memory 102 could be supplied by bus adaptor 120 
and I/O bus 110. 

What should now be appreciated is that the present 
invention can be utilized with any of the computer 
architectures shown in FIGS. 1, 2 and 3. The present 
invention could also be utilized with any other com- 
puter architecture which would permit the use of vir- 
tual FIFOs in a system memory 102 or special I/O 
memory. 

Referring now to FIGS. 3 and 4, it should be under- 
stood that DMA controller 404 may also include a 
virtual memory management unit (VMU; not shown), if 
desired. _ . _ . . 

Referring now to FIG. 4, DMA controller 404 reads - 



ics processor (not shown). Any conventional fabrica- 20 the (software) virtual' FIFO that is being used to drive 
tion process can be used for building hardware FIFO hardware FIFO 402. DMA controller 404 passes com- 
402 either separately on a chip, or as part of a larger raands and data from the virtual FIFO that is driving* 
chip having additional subsystems and functionality, ^hardware FIFO 402 so that the commands and data are 
The size of the hardware FIFO depends on a variety of received by peripheral device 401/ Moreover, DMA 
system criteria. For example, if the hardware FIFO was 25 controller 404 provides status information and inter- 
16 words (4 bytes each) deep, then the virtual FIFO's 



should be 512 bytes, which should be adequate for 
lower performance 2D applications. For more complex 
3D operations the virtual FIFO's could be, for example, 
8K bytes. In addition, hardware FIFO 402 in a pre- 
ferred embodiment holds two of the largest divisible 
operations capable of being performed by peripheral 
401. However, the exact constraints of hardware FIFO 
402 depend on the latency of main memory. 

At any time in the operation of the present invention, 
only one of virtual FIFOs 406, 408 or 410 is used to 
drive peripheral device 401 using hardware FIFO 402. 
For a given hardware FIFO 402, the host processor can 
"compute ahead" and generate data for large virtual 
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rupts via a path 430 between it and host processor 126. 
Path 430 on a physical level will depend upon the com- 
puter architecture of the types as shown in FIGS. 1, 2 
and 3 that is used. 

One of the important aspects of the present invention 
is that the queues contained in virtual FIFOs 406, 408 
and 410 can be switched when the peripheral context is 
switched. The ability to switch between two or more 
contexts in a peripheral is becoming increasingly impor- 
tant as the functionality of the computer system and the 
peripheral increase. For example, windowing systems 
used in graphics processors will include two or more 
contexts. In other words, each window that is present in 
the peripheral is a separate context The present inven- 



FIFO without waiting until it actually has access to the 40 tion can easily accommodate the context switching so 



hardware. Moreover, the host processor need not 
"poll" the pointers constantly to know if it has the nec- 
essary space. Thus, two very significant advantages are 
achieved by the system and method of the present in- 
vention: a reduction in latency, and a minimization of 45 
hardware in the use of the bandwidth of main memory 
102. 

The example of the present invention shown in FIG. 
4 has a DMA controller 404. It should be understood 



that it is done relatively fast and efficiently. 

The context switching or "swaps" of the present 
invention are under control of higher level software 
that is operating on the digital processor 126. Any appli- 
cation which requires a smart peripheral which is pro- 
grammed with software has the potential of passing 
data, and thus, would benefit from the present inven- 
tion. An example of such an application would be win- 
dow system software. 



that this representative example of the present invention 50 B. Memory Address Snooping 



envisions a larger host processor and peripheral proces- 
sor architecture of the type shown in FIG. 3. Specifi- 
cally, referring again to FIG. 3, DMA controller 404 
would be a combination of I/O controller 112, and 
DMA 302 and the address-translation registers 304. 

FIG. 14 shows a block diagram of an embodiment of 
the present invention. A peripheral 1400 includes hard- 
ware FIFO 402, DMA controller 404, and a graphics 
functions stage 1402, all of which are physically part of 
the peripheral 1400. Note that DMA can optionally 
include a virtual memory management unit (VMU) 
1404, if desired. Also note that all of these components 
can be physically located on the same chip as the CPU. 

The present invention is also equally applicable to the 
architectures shown in FIGS. 1 and 2. Specifically, 65 
DMA controller 404 with reference to the architecture 
shown in FIG. 1 could be part of the I/O controller 112, 
so that the data that is supplied by host processor 126 



Referring again to FIG. 4, each virtual FIFOs 406, 
408 and 410 has an associated FIFO control block. 
Specifically, virtual FIFO 406 has a FIFO control 
block 440, virtual FIFO 408 has a FIFO control block 
442, and virtual FIFO 410 has a FIFO control block 
444. The control blocks 440, 442 and 444 are located in 
main memory 102, but could also be located in any 
available memory location in the host processor system. 

As js oUscussed in greater detail below, host processor 
126_(orlany other, processor that is controlling the pe- 
'_ripheral device 404; not shown) periodically writes to 

tte memory control block of the virtual FI^O that is ? 
^.beihg Jused-to drive the peripheral device 401. 

DMA controller 404 snoops these accesses to the 
memory blocks by digital processor 126. The word 
snoop is synonymous with the word monitor. This 
snooping by DMA controller 404 occurs via a path 424 
(labeled "read/snoop address"). 



55 



60 
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When DMA controller 404 detects that host proces- 
sor-J26 has .written to the control block of the virtual* 
FIFO that i^'cSvrni'pe'ripheral device *401; DMA con? 
troller 404 updates the end address, as discussed below. 

Snooping is not required if the host processor writes 5 
directly into DMA 404 registers, as opposed to writing* 
'into the virtual FIFO's control block! In addition, 
snooping can also be disabled if data is written directly 
into hardware FIFO 402 by the host processor. 
C. Virtual FIFOs 10 

FIG. 5 illustrates a preferred virtual FIFO 406, 408, 
410 data structure in accordance with the present inven- 
tion. FIG. 6 is a flow chart which shows at a high level 
the control of the data queues (virtual FIFOs 406, 408 
and 410). Once again note that DMA controller 404 anft 15 
the software running on host processor 126 jointly Con- 
trol the virtual FIFOs (data queues') 406, 408 and 410 
(shown in FIG. 4). 

Specifically, as indicated in block 602, the host soft- 
ware (not shown) running on host processor 126 sets up 20 
(i.e., initializes) the data queue (virtual FIFO) driving 
peripheral device 401, as indicated by a step 604. Next, 
the host software sets up DMA controller 404, as indi- 
cated by a step 606. 

In a step 608, the host software instructs DMA con- 25 
troller 404 to use the data queue. In the usual case, 
certain data queues, as discussed below, are assigned to 
particular processes. The host software knows which 
data queue to use for a particular process. However, as 
indicated above, a single virtual FIFO can be used for 30 
multiple processes. 

In a step 610, host processor 126 places data in the 
data queue that the host software has instructed to be 
used. For purposes of illustration, and referring again to 
FIG. 4, assume that the data queue being used for the 35 
discussion below is virtual FIFO(2) 408. Note that the 
associated control block for virtual FIFO(2) 408 is 
memory block 442. Also note that the discussion below 
applies to any virtual FIFO and not just virtual FIFO(2) 
408. 40 

Host processor 126 changes the end address in con- 
trol addresses memory block 442 for data queue 408, 
this change is indicated by a block 612. DMA controller 
404 then reads the new end address placed by the digital 
processor 126 into the control addresses memory block, 45 
as indicated by block 614. DMA controller 404 then 
reads commands and parameters from data queue/408. 
This step is indicated by a block 616. 

Next, DMA controller 404 writes a new start address 
in the control block 442 each time it reads a block of/50 
data from virtual FIFO 4087 Note that the DMA con- 
troller writes the new start address each time a block of 
data has been read. (See FIG. 8 for start address pointer 
strategy.) This is important in connection with context 
switching, discussed below. This step of writing a new 55 
start address is indicated by a block 618. 

Finally, DMA controller 404 snoops the end address 
present in the control block of virtual FIFO 408. This is 
indicated by path 424 of FIG. 4. This step in FIG. 6 is 
indicated by a step 620. DMA controller 404 reads the 60 
end address in the memory block if it determines that it 
has been modified. As discussed below, this allows for 
context switching to occur on a relatively fast basis. 
, At any time, host processor 126 can place data into* 
the data queue being used if there is space in the data^65 
queue./This indicated by a block 622, in FIG. 6. It is 
important to note that the host processor can place data 
at any-time into the data-queue. This increases sysfems 
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throughput and response time. It also acts to reduce 
latency. - 

At any" time, DMA controller 404 can remove data 
from the data queue being used as long as there is data 
available in the data queue. This is indicated by a block 
624 in FIG. 6. Thus, peripheral device 401 can be pro- 
vided with data from its associated data queue as long as 
there is data available. This acts to reduce latency. 

When there is no data in a particular data queue, the 
start address and the end address stored in the control 
block will be the same. Referring now to FIG. 5, a 
preferred data structure for the data queues (virtual 
FIFOs) is shown. Referring to FIG. 5A, it shows that 
the data (labeled valid data) in the data queue begins at 
a memory location indicated by the start address, and 
ends at a higher numbered memory location indicated 
by the end address. The start address is indicated by a 
reference number 502, and the end address is indicated 
by a reference number 504. The start of data is indicated 
by a reference number 506, and the end of data is indi- 
cated by a reference number 508, As shown, the data is 
written in queue addresses beginning with a low address 
and going to a high address. 

As discussed below, the DMA controller reads data 
starting at the start address, as indicated by a reference 
numeral 510. In contrast, as discussed below, host pro- 
cessor 126 adds data to the data queue beginning at end 
address 504, as indicated by a reference numeral 512. 

In the example shown in FIG. 5A, the amount of 
valid data contained in the data queue and the physical 
location of this data is such that the data does not have 
to wrap around to the beginning of the data queue. In 
other words, the valid data does not extend over the top 
of the queue. 

FIG. SB illustrates an example where the data wraps 
to the beginning of the data queue. Referring now to 
FIG. 5B, it is seen that start address 502 indicates that 
the start of the data is at a mid address in the data queue. 
This typically occurs after DMA controller 404 has 
read data out of the data queue. The valid data begin- 
ning at the summary location indicated by the start 
address 502 is written to the top of the queue, which has 
the high address for the queue. Since there is more valid 
data than can be accommodated, the additional data is 
written into the data queue beginning at the low ad- 
dress, and continuing up to the end of data, as indicated 
by the end address 504. It is thus seen in the example, 
that the data is wrapped around. This wrap around 
approach allows for host processor 126 and the DMA 
controller 404 to read data into and write data out of, 
respectively, the data queue at any time, as discussed 
above in connection with step 622 and 626. DMA con- 
troller 404 has logic incorporated to handle the queue 
wrap around addressing. 

It should be understood that the present invention 
also can accommodate other software data structures 
for the virtual FIFOs. For example, the start address 
and end address approach of FIG. 5 could be utilized in 
a situation where the data is written from the high ad- 
dress to the low address, as opposed to the low address 
to the high address as shown in the example. In this 
way, the data would wrap around at the bottom as 
opposed to wrapping around at the top, as shown in the 
example at FIG. 5B. The important concept is the use of 
the start address 502 and the end address 504 indicating, 
respectively, the start of data and the end of data in the 
data structure of the data queue (virtual FIFO). 
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As stated, when the start pointer (indicated by the The use of these two start pointers by DMA control- 
start address 502) reaches the top of the data queue, it is ler 404 allows hardware FIFO 402 to be flushed (that is, 
reset to the bottom of the data queue before the next any data remaining in the hardware data queue 402 is 
data entry can be made into the data queue. The data discarded) when a context switch has been requested 
queue wraps around to the bottom after reaching the 5 without reading the Ha tfl remaining in hardware FIFO 
t0 P- 402 into another memory buffer (not shown), or actu- 

As shown in the example of FIG. 4, there are three ally writing it back into the data queue of main memory 

data queues (virtual FIFOs) in main memory 102. This 102. This acts to significantly decrease latency and sys- 

allows for context switching between processes being tern memory bandwidth during context switching in 

run by the host software to be performed by the present 10 accordance with the present invention, 

invention. Specifically, each process being run by the The end address written into memory holds the end 

host software may have its own virtual FIFO (data of the data actually consumed by hardware FIFO 402. 

queue). Thus, each process is not affected by the other In other words, the end address written into memory is 

processes. not the address of the last data item that was read into 

In this way, relatively fast context switching can be 15 the hardware data FIFO 402. 

performed by the present invention because the various It should be understood that the cost of this aspect of 

processes may put data into their respective data queues the present invention is that some data items remain in 

even when the peripheral hardware is servicing another the virtual FIFO while it is actually in hardware FIFO 

process through the virtual FIFO associated with the 402 as well. When the particular process (context) is 

process being serviced. This acts to reduce latency 20 resumed, DMA controller 404 is loaded with the start 

significantly. pointer which holds the last unused data item that was 

Referring to FIG. 4, context memory blocks 412, 414 read out to peripheral device 401. 

and 416 located in main memory 102 hold state informa- Note that the pointers associated with a queue are 

tion associated with a context (process). Note that only saved when the queues are switched, 

three context memory blocks are shown. This is for 25 D. Context Buffer 

illustration only. The present invention contemplates As discussed above, the context (processes) are stored 

allocating any number of context memory blocks in in the context memory locations 412, 414 and 416 when 

main memory 102. When a context is switched the cur- not in use. 

rent context is read out of the virtual FIFO and stored The context may be copied, and restored from system 

in a context memory block 421. Subsequently, another 30 memory buffers (Context No. 1-3), where they are 

context is loaded from memory into that particular stored when not in use. The buffers hold enough data to 

virtual FIFO. restart an interrupted context. Each process may have 

Typically, the virtual FIFO (data queue) is swapped one or more contexts which may be swapped. The 

when a context switch is performed by the host soft- context is saved when an acceptable point in the data 

ware operating on the digital processor 126. However, 35 has been reached. 

this does not necessarily have to take place. For exam- Peripheral devices 401 often require that certain sets 
pie, if two processes are associated with the same data of data be used together and that the context of the 
queue, such a swap would not take place. An instruction device may not be swapped unless the whole packet of 
in the data stream would signal the hardware to syn- data is received and handled in some way by the periph- 
chronously switch contexts. A preferred embodiment 40 eral. If there are requirements of this type, the DMA 
would not flush the virtual FIFO. Instead, it would just device or the DMA controller will need to examine the 
continue reading the data stream. All the instructions output of the hardware FIFO to determine when con- 
ahead of the switch instruction would complete there text of the peripheral device may be switched, 
operations. Subsequently, a set of instructions would be E. Signaling Mechanisms 

sent down the pipeline to start saving contexts as the 45 DMA controller 404 provides two signaling mecha- 

different stages become idle. These operations are all nisms in accordance with the present invention. As 

synchronous. Thus, it should be understood that the shown in FIG. 4, these are provided by path 430 from 

present invention does not require that the data queues DMA controller 404 to the digital processor 126. 

be swapped in order for context switching to be per- The first signaling mechanism provided by the DMA 

formed. 50 controller 404 is interrupts. The second is status, which 

Referring now to FIG. 4, the hardware FIFO dis- is poling (reading a status register and testing the con- 
cards any data remaining in it when the context is tents) by host processor 126 of the DMA controller 404. 
switched. This can be done by the present invention It should be noted that host processor 126 may enter 
since the data that is being discarded from hardware a "signal** primitive into the data queue which generates 
FIFO 402 is still in the virtual FIFO (data queue) associ- 55 an interrupt to the digital processor 126. Host processor 
ated with the process. This architecture of the present 126 may use this primitive to synchronize the software 
invention reduces the amount of state information to operations in the hardware, 
which must be saved during context switches. F. Pointer Control Strategy 

This functionality of the present invention is accom- The FIFO pointers shown in FIG. 5 used in the pres- 

plished as follows. DMA controller 404 keeps (stores 60 ent invention are jointly managed by host processor 126 

and maintains) two pointers associated with the start and DMA controller 404. It should be noted that these 

address. The first pointer (fetch pointer) specifies the pointers do not have to be exactly current in accor- 

memory location of the next data item in the data queue dance with the present invention. System efficiency will 

being used which is to be loaded into the data FIFO be higher if the pointers are updated only when neces- 

402. DMA controller 404 also keeps a start pointer 65 sary. The present invention contemplates and takes 

(start pointer), which specifies the memory location of advantage of this. Note that the sizes of the virtual 

the data item in the data FIFO 402 which is next to be FIFOs in the system memory 102, and the data FIFO 

read out to peripheral device 401 via a path 418. (hardware) 402 and the amount of data required for 
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proper operation of the peripheral device 401 must be It should be understood, however, that the present 

taken into account when controlling these pointers. invention is not limited to this strategy for updating the 

Host processor 126 initializes start address 502, but data pointers. Any suitable strategy for updating the 

manages end address 504. DMA controller 404 initial- data pointers is contemplated by the present invention, 

ues end address 504, but manages start address 502. In 5 G. Variations And Alternate Embodiments In The Sys- 

other words, the host processor 126 is the writer of the tem And/Or Method 

start address, while the peripheral processor 401 (or In one alternate embodiment, host processor 126, 

DMA controller 404) is the reader. The peripheral pro- DMA controller 404, and hardware FIFO 402 can all 

cessor 401 (or DMA controller 404) is the writer of the physically be present on a single microelectronic device 

end address, while the host processor 126 is the reader. 1° $0°, as indicated in block diagram form in FIG. 9. As 

Each time a start address pointer or an end pointer is shown in FIG. 9, the various data and control lines 

modified in the control addresses memory block in the between these three functional blocks can be fabricated 

system memory 102, the reader of the particular pointer of the single microelectronic device, and an 

may read it. This is important for control in the present input/ output stage can provide the necessary control 

invention. 15 ^ data paths between these three functional blocks 

In order to save bandwidth in both writing and read- ^ nd main memory 102, peripheral device 404, and the 
ing of the start and end address pointers, the following 

strategies discussed below in connection with FIGS. 7 lt . should be understood that any suitable type of 

and 8 can be used in accordance with the present inven- fabrication technology and process can be used for 

tion: 20 implementing this alternate embodiment of the present 

The start address pointer strategy is as follows: invention. 

1. The start address is updated if the amount of data DM ^ contro »er 404 can be used to control two or 
left in the virtual FIFO is less than the maximum m0 ™ channels * Referring now to FIG. 10, DMA con- 
increment which is fetched from the virtual FIFO ? oUqt ^ contro1 n channels » where n « a positive 
in a transaction, as indicated by a block 802. This 25 int f gcr = than \ Each channeI has an associated pe- 
keeps the memory bandwidth efficient. n . phe J^ d * vlce ^p ch Peripheral 401* has an asso- 

2. The start address is updated if the context is Ciated hardwa ^ e FIF ° n ' which is filled up from main 
switched, as indicated by a block 804; EE?* M2 ./ or memories) under control of 

3. The start address is updated if the data FIFO « u^au a ♦ a u t ^ _ T _ _^ . 
(queue) 402 is empty, as indicated by a block 806; 30 * T *l I * " 

4. The start address is updated if the amount of data J^^S? C vff ^ ? ^[* ^I^X* 
fMH fr _ fha . to PT p n , _ , AM • _ , . electronic device which also has DMA controller 404, 
read rrom the data FIFO (queue) 402 since the last ^ * «v,.„: m ik * j r t-xw * ^ „ 

time the start address was updated is large (for be phyS ' CaUy from DMA c ° ntroller 

oZSl^diShv f hJf e v1S5f h ° f 35 ™« em "odiment is particularly useful in a situation 

queue), as mdicated by a block 810 where mu]ti le ri ^ devi / for j 

S-Ifrequestedbythehostprocessorthestartaddress ^ ^ £ J n ? , ^ Qf da £ ^ 

^ht%^ JT^ % ^ S \ P S° r contml of a ™& DMA controller 404. It could also be 

ZftLTtf ?V£ ! "1° * «W m a Processor situation where the 

empty and the : S Urt address pomter was pomnngat n mcs or W0 £ Id fee We f ^ Qn 

fnt ^« r 6 "'/^? 16 ^^ 0 !^ or more P rocesses (context) or multipk communi- 

the host processor hassUrted loading the data. cations channels ( „ e ,work interfaces) at 

The end address pointer strategy is as follows: ^ t same time B r 7 

L J*? f n * not updated if only part of an The preseM mvention ^ contemplates an embodi- 

* .'Tt^T °v n bee " entered . T ^ T 45 ment where two or more digital processors are control- 

S*^ u ™, ^? UeUC) m m ! m0ry -' f l ? d J ,Ca " d ^* lin S DMA controller 404. Referring now to FIG. 11, 

block 702. This prevents the peripheral device 401 DMA controller 404 is controlled by host processor 

from starting an indivisible operations) in the mid- 12 6, and also by a second host processor 1126. (Note 

die. In other words, if the end pointer can only ^ ^ numb ering used in FIG. 11 shows the corns- 

point to the end of indivisible operations, DMA 50 sponding blocks and lines between the two digital pro- 

404 will not feed operations which are not com- cessors as indicated by the right two-most digits of the 

plete to peripheral 401. This strategy also prevents reference number; for example, host processor 1126 is 

system lock-up if the writing process in the digital comparable to the host processor 126). 

processor 126 is suspended before completing the Note that as shown in FIG. 11, each host processor is 

entry into the virtual FIFO (queue). 55 controlling its own system memory: system memory 

2. The end address is written into the memory block 102 for host CPU 126, and a system memory 1102 (not 
(updated) if (1) the stream of data is complete, as shown) for host CPU 1126. Data from each of these 
mdicated by a block 704; or (2) if a large block of memories in the virtual FIFOs in these memories is 
data (for example, one-fourth of the depth of the provided to the (hardware) data FIFO 402 by lines 426 
data queue) has been written into the virtual FIFO 60 and 1126, respectively. In this way, hardware FIFO 402 
since the last time the end address was updated as can be supplied with data under control of each of the 
indicated by a block 706. host CPUs 126 and 1126. 

3. If virtual FIFO is full (no space available) then the . As is well known, any control scheme can be utilized 
end address pointer is updated so that the hardware between host CPU 126 and host CPU 1126 so that they 
can do maximum fetching. 65 are not asking DMA controller 404 and hardware FIFO 

The end address should always be updated so divisible 402 to operate on their data at the same time, 

operations to the hardware are maintained between the An alternate embodiment associated with FIG. 11 is 

start and end pointers. to have host CPU 126 being provided with data directly 
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by another host CPU (not shown). This would be par- 
ticularly advantageous in a graphics situation where the 
other host CPU would be working on or retrieving data 
for host CPU 126 in a background mode. This data 
would then be provided to host CPU 126 after it had 5 
been generated, or had been fetched. 

Thus, it should be understood that the present inven- 
tion contemplates embodiments utilizing two or more 
host CPUs, either directly controlling DMA controller 
404, or cascaded with respect to each other. 1° 

The present invention also contemplates an embodi- 
ment where a single digital processor having a single 
host CPU 126 controls two or more DMA controllers 
with associated (hardware) data FIFOs. An example of 
this embodiment is shown in block diagram form in 15 
FIG. 12. As shown, there are two DMA controllers: 
DMA controller 404 and DMA controller 1204. [Note, 
the numbering convention used in FIG. 12 indicates 
similar block diagrams and lines by the same right two- 
most digits; for example, DMA controller 1204 is simi- 20 
lar in structure and operation to the DMA controller 
404.] 

Thus, a single host processor 126 can control the 
operation of two peripheral devices 401 and 1201, each 
under control of its respective DMA controller 404, 
1204. 

As shown in FIG. 11, there arc two streams of data 
that are being supplied to hardware FIFO 402: stream 
426 and stream 1126. DMA controller 404 can keep 3Q 
track of these streams of data in the single hardware 
FIFO using either tags or other identification strategy. 
The present invention contemplates an embodiment 
where hardware FIFO 402 is provided with two or 
more streams of data, as opposed to being limited to 35 
only those embodiments where each hardware FIFO 
only has a single stream of data. Tagging or other suit- 
able identification can be used by DMA controller 404 
to identify from which stream the particular data has 
come. 43 

Typically, the main memory 102 is the main memory 
of the digital processor. However, the present invention 
can also be utilized in situations where there is no direct 
access to the main memory 102. 

An embodiment which allows the present invention 45 
to be implemented where direct access to the main 
memory 102 is not possible is shown in FIG. 13, A 
buffer (also denominated a dual ported memory) 102' 
has one port connected directly to the I/O bus 110. The 
host CPU 126, DMA controller 404 and the hardware 50 
FIFO 402 (all of which arc not shown in FIG. 13) ob- 
tain access to the virtual FIFOs, and the associated 
control addresses via the lines indicated by the series 
400 numbers. 

Note that buffer 102' could be a single ported mem- 55 
ory, where both the host processor and DMA control- 
ler 404 could gain access to the data and other informa- 
tion stored in it. However, the preferred embodiment is 
that of a dual ported memory. 

This embodiment, shown in FIG. 13, is particularly 60 
advantageous in some personal computer environments. 
In such situations, the present invention is sitting on I/O 
bus 1110, but cannot be a bus master. For example, on 
the IBM AT Class personal computer machines, there is 
no direct memory access by peripherals, other than the 65 
standard internal DMA controller which does not sup- 
port the virtual FIFO functionality, like there is on 
some of the newer, more robust architectures. The pres- 



ent invention therefore can overcome this deficiency 
without modification to the PC machine itself. 
H. Conclusion 

Thus, it is seen that the present invention allows for 
relatively rapid swapping of peripheral device contexts. 
It also presents the appearance of multiple FIFOs to the 
host software. The software generating the data to be 
presented to peripheral device 401 need not be con- 
cerned that other devices are also using peripheral de- 
vice 401. Multiple "client" processes, for example, can 
time multiplex the use of a hardware peripheral device 
401, but continue to create data and commands and 
place them into their "virtual FIFO's" until the virtual 
FIFO's are full. 

Finally, the present invention allows very large data 
queues to be created with very modest hardware cost 
and with very modest use of memory bandwidth. 

While the invention has been particularly shown and 
described with reference to preferred embodiments 
thereof, it will be understood by those skilled in the art 
that various changes in form and details may be made 
therein without departing from the spirit and scope of 
the invention. 

What is claimed is: 

1. A virtual FIFO peripheral interface system for use 
with a host processor and a peripheral processor in 
connection with queuing, control and transfer of data 
from the host processor to a peripheral, comprising: 

(a) at least one virtual FIFO data structure located in 
a main memory; 

(b) a control addresses block associated with each of 
said at least one virtual FIFO data structure; 

(c) a hardware FIFO, coupled to said virtual FIFO 
data structure, and controlled by the peripheral 
processor, which supplies data to the peripheral; 

(d) a direct memory access (DMA) controller, cou- 
pled to said hardware FIFO and the host proces- 
sor, which controls the data transfer from said at 
least one virtual FIFO data structure to the periph- 
eral processor; 

(e) wherein said control address block includes a start 
address pointer and an end address pointer, 
wherein said start and end address pointers indicate 
the memory location in said associated virtual 
FIFO data structure where information is stored, 
said start address pointer and said end address 
pointer are managed jointly by the host processor 
and said DMA controller; and 

(f) means for updating said start address pointer inj 
, said control address block according to the follow- 

' ing protocol: 7 

(i) updating said start address pointer if the amount 
of data left in said virtual FIFO data structure,*! 
less than the maximum increment which can b£ 
fetched from said virtual FIFO data structure in 
a single memory transaction; f 

(ii) updating said start address pointer if the context 
is switched in said associated virtual FIFO data 
structure; 

(iii) updating said start address pointer if said hard- 
ware FIFO is empty; 

(iv) updating said start address pointer if the 
amount of data read from said hardware FIFO 
since the last time said start address pointer was 
updated is substantially large; and 

(v) updating said start address pointer if requested 
by the host processor. 
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2. The system of claim 1, wherein the host processor 
controls means for loading data into each of said at least 
one virtual FIFO data structure. 

3. The system of claim 1, wherein said hardware 
FIFO is physically located on the same chip as the 5 
peripheral processor. 

4. The system of claim 1, wherein the peripheral 
processor includes a read address pointer which points 
to the last unused data read out of said hardware FIFO 

to the peripheral processor. io 

5. The system of claim 1, wherein the storage capac- 
ity of said hardware FIFO is substantially smaller than 
said virtual FIFO data structure. 

6. The system of claim 1, wherein the peripheral 
processor is a graphics processor, and wherein said 15 
graphics processor includes said DMA controller and 
said hardware FIFO. 

7. The system of claim 1, wherein the host processor 
contains a host software means for initializing said vir- 
tual FIFO data structure and said DMA controller. 20 

8. The system of claim 1, wherein said DMA control- 
ler contains a fetch pointer which specifies the memory 
location of the next data item in said virtual FIFO data 
structure to be loaded into said hardware FIFO and a 
start pointer which specifies the memory location of the 25 
data item in said hardware FIFO which is next in line to 
be read by the peripheral processor. 

9. The system of claim 1, wherein said main memory 
includes at least one context memory block for storing 
information concerning the contexts currently operat- 30 
ing in the peripheral processor. 

10. The system of claim 1, further comprising means 
for updating said end address pointer. 

11. The system of claim 1, wherein said hardware 
FIFO provides data to a first of n channels, and further 35 
comprising n— 1 additional hardware FIFOs, coupled 

to said DMA controller, for providing data to the other 
respective n — 1 channels, where n is an integer greater 
than 1. 

12. The system of claim 1, wherein said host proces- 40 
sor comprises a first host processor, and further com- 
prising at least one additional host processor cascaded 
with said first host processor for providing data to said 
first host processor. 

13. The system of claim 1, further comprising at least 45 
one additional parallel host processor directly con- 
nected to said DMA controller. 

14. The system of claim 1, further comprising at least 
one additional DMA controller coupled to said host 
processor for controlling the transfer of data from said 50 
host processor to an associated hardware FIFO which 
holds said transferred data for an associated peripheral. 

15. A method for transferring information between a 
host processor and a peripheral processor, comprising 
the steps of: 55 

(1) initializing a set of virtual FIFO data structures 
and associated control addresses blocks located in a 
main memory, each of said associated control ad- 
dresses blocks having start and end address point- 
ers, and initializing a direct memory access (DMA) 60 
controller, said DMA controller being coupled to a 
host processor and a hardware FIFO; 

(2) selecting one of said virtual FIFO data structures 
and instructing said DMA controller to use said 
selected virtual FIFO data structure; 65 

(3) storing information in said selected virtual FIFO 
data structure and changing an end address pointer 
located in said control addresses block associated 



with said selected virtual FIFO data structure to 
correspond to said stored information; 

(4) transferring data from said selected virtual FIFO 
data structure into a hardware FIFO; 

(5) updating said start address pointer located in said 
control addresses block associated with said se- 
lected virtual FIFO data structure according to the 
following protocol: 

(i) updating said start address pointer if the amount 
of data left in said virtual FIFO data structure is 
less than the maximum increment which can be 
fetched from said virtual FIFO data structure in 
a single memory transaction; 

(ii) updating said start address pointer if the context 
is switched in said associated virtual FIFO data 
structure; 

(iii) updating said start address pointer if said hard- 
ware FIFO is empty; 

(iv) updating said start address pointer if the 
amount of data read from said hardware FIFO 
since the last time said start address pointer was 
updated is substantially large; and 

(v) updating said start address pointer if requested 
by the host processor; and 

(6) transferring data from said hardware FIFO to the 
peripheral processor; 

wherein said DMA controller is configured for reading 
said end address pointer in said control addresses block 
associated with said selected virtual FIFO data struc- 
ture and writing a new start address into said start ad- 
dress pointer located in said control addresses block 
associated with said selected virtual FIFO data struc- 
ture. 

16. The method of claim 15, further comprising the 
steps of snooping said end address pointer and reading 
said end address pointer if it has been modified. 

17. The method of claim 15, further comprising the 
step of updating said end address pointer located in said 
control addresses block associated with said selected 
virtual FIFO data structure according to the following 
protocol: 

(i) not updating said end address pointer if only part 
of an indivisible operation has been entered into 
said associated virtual FIFO data structure; 

(ii) updating said end address pointer if the stream of 
data is complete; 

(iii) updating said end address pointer if a substan- 
tially large block of data has been written into said 
virtual FIFO data structure since the last time said 
end address pointer was updated; and 

(iv) updating said end address pointer if said virtual 
FIFO data structure is full. 

18. The system of claim 17, wherein said first DMA 
controller is connected to a first hardware FIFO, and 
the system further comprising a second DMA control- 
ler, said second DMA controller connected to a second 
hardware FIFO. 

19. A virtual FIFO peripheral interface system for 
use with a host processor and a peripheral processor in 
connection with queuing, control and transfer of data 
from the host processor to a peripheral, comprising: 

(a) at least one virtual FIFO data structure located in 
a main memory; 

(b) a control addresses block associated with each of 
said at least one virtual FIFO data structure; 

(c) a hardware FIFO, coupled to said virtual FIFO 
data structure, and controlled by the peripheral 
processor, which supplies data to the peripheral; 
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(d) a direct memory access (DMA) controller, cou- 
pled to said hardware FIFO and the host proces- 
sor, which controls the data transfer from said at 
least one virtual FIFO data structure to the periph- 
eral processor; 5 

(e) wherein said control address block includes a start 
address pointer and an end address pointer, 
wherein said start and end address pointers indicate 
the memory location in said associated virtual 
FIFO data structure where information is stored, 10 
said start address pointer and said end address 
pointer are managed jointly by the host processor 
and said DMA controller; and 

(0 means for updating said end address pointer in said 
control address block according to the following 15 
protocol: 

(i) not updating said end address pointer if only 
part of an indivisible operation has been entered 
into said associated virtual FIFO data structure; 

(ii) updating said end address pointer if the stream 20 
of data is complete; 

(iii) updating said end address pointer if a substan- 
tially large block of data has been written into 
said virtual FIFO data structure since the last 
time said end address pointer was updated; and 25 

(iv) updating said end address pointer if said virtual 
FIFO data, structure is full. 

20. The system of claim 19, wherein said DMA com- 
prises a snooping means for snooping said control ad- 
dresses block for a change in said end address pointer. 30 

21. A method for transferring information between a 
host processor and a peripheral processor, comprising 
the steps of: 

(1) initializing a set of virtual FIFO data structures 
and associated control addresses blocks located in a 35 
main memory, each of said associated control ad- 
dresses blocks having start and end address point- 
ers, and initializing a direct memory access (DMA) 
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controller, said DMA controller being coupled to a 
host processor and a hardware FIFO; 

(2) selecting one of said virtual FIFO data structures 
and instructing said DMA controller to use said 
selected virtual FIFO data structure; 

(3) storing information in said selected virtual FIFO 
data structure and changing an end address pointer 
located in said control addresses block associated 
with said selected virtual FIFO data structure to 
correspond to said stored information; 

(4) transferring data from said selected virtual FIFO 
data structure into a hardware FIFO; 

(5) updating said end address pointer located in said 
control addresses block associated with said se- 
lected virtual FIFO data structure according to the 
following protocol: 

0) not updating said end address pointer if only 
part of an indivisible operation has been entered 
into said associated virtual FIFO data structure; 

00 updating said end address pointer if the stream 
of data is complete; 

(iii) updating said end address pointer if a substan- 
tially large block of data has been written into 
said virtual FIFO data structure since the last 
time said end address pointer was updated; and 

(i v) updating said end address pointer if said virtual 
FIFO data structure is full; and 

(6) transferring data from said hardware FIFO to the 
peripheral processor; 

wherein said DMA controller is configured for reading 
said end address pointer in said control addresses block 
associated with said selected virtual FIFO data struc- 
ture and writing a new start address into said start ad- 
dress pointer located in said control addresses block 
associated with said selected virtual FIFO data struc- 
ture. 
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